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The present invention relates to combinatorial libraries and to 
methods for making and using same. 



required the iterative synthesis, testing, and modification of individual compounds based 
on natural product leads or de novo molecular design. The emerging field of 
combinatorial synthesis shows great promise for streamlining this process, enabling the 
rapid generation of structurally diverse libraries of up to millions of compounds, which 
may be screened against a biological receptor (Gordon et al., "Applications of 
Combinatorial Technologies to Drug Discovery. 2. Combinatorial Organic Synthesis, 
Library Screening Strategies, and Future Directions" J. Med. Chem ., 37:1385-1401 
(1994); Virgilio et al., "Simultaneous Solid-Phase Synthesis of beta-Turn Mimetics 
Incorporating Side-Chain Functionality" J. Am. Chem. Soc , 116:11580-11581 (1994); 
Brenner et al., "Encoded Combinatorial Chemistry" Proc. Natl. Acad. Sci. USA , 89:5381- 
5383 (1992); Ohlmeyer et al., "Complex Synthetic Chemical Libraries Indexed With 
Molecular Tags" Proc. Natl. Acad. Sci. USA . 90:10922-10926 (1993); Carell et al., "New 
Promise in Combinatorial Chemistry: Synthesis, Characterization, and Screening of 
Small-Molecule Libraries in Solution," Chemistry & Biology , 2:171-183 (1995)). 

However, this approach does not represent a complete solution to the 
problem of identifying receptor ligands, for several reasons. 

First, unless the library is des igned to "tu ne^ the activity of a lead 
compound, theupr<5portion of active species relative to the whole is extremejy__srriaH. This 



has the effect of rendering detection of the active compounds against the background of 



FIELD OF THE INVENTION 



BACKGROUND OF THE INVENTION 



Until recently, the development of ^ilgand for a biologicaTrfoggptor has 
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are not meani ngful when _aBplied-to-rnixtures.of-Compounds. Typically, the detection 
problem has been addressed by coupling the receptor to a reporter molecul^ such as a 
dye, a fluorescent tag, or a substrate for another, readily detectable protein (e.g., coupling 
to biotin for detection by a streptavidin-alkaline phosphatase conjugate). However, these 
5 techniques, particularly the two-stage assay process, can change the properties of the 
receptor of interest, or cause 'ligands to the secondary receptor rather than the primary 
receptor to be identified (Devlin et al., "Random Peptide Libraries: A Source of Specific 
Protein Binding Molecules" Science 249:404-406 (1990); Lam et al., "A New Type of 
Synthetic Peptide Library for Identifying Ligand-Binding Activity" Nature (London), 

10 354:82-84(1991)). 

Second, the total amount of active compound present in an assay of this 
type is generally too small to permit direct structural analysis by spectroscopic means, 
requiring instead that the compound be identified through the{ analysis of encodedTag^ > or 
iterative deconvolution (although some direct methods employing gel-phase NMR or 

15 mass spectrometry (Youngquist et al., "Generation and Screening of Combinatorial 

Peptide Libraries Designed for Rapid Sequencing by Mass Spectrometry" J. Am. Chem. 
Soc . 1 17:3900-3906 (1995) have appeared), and subsequent resynthesis for full 
characterization. Finally, if library synthesis and evaluation is carried out on solid 
support,<^^pound behavior is not necessarily transferable to the soluti on ph asfe. On the~ 

20 other handrsolutimPbaSea'strategi^>for library synthesis require that analysis proceed 



through the time-consuming process cx fjterative deconvolutiorr^ (Erb et al., "Recursive 
Deconvolution of Combinatorial Chemical Libraries" Proc. Natl. Acad. Sci. USA , 
91:11422-11426(1994)). 

The present invention is directed to overcoming these and other 
25 deficiencies in the art. 

SUMMARY OF THE INVENTION 



The present inv ention relates to a^c ombinatorial^ibr^ y which includes a 
30 plurality of aHeast six differe nt complexes.^ Each of the complexes is formed of at least 
^one complexing a^'nt S Tfl jat least two non-biopolymer Hganfl^ that are reversibly bonded 
to the complexing agent, and each different complex in the library has different ligands 
bonded to the complexing agent. 



The present invention also relates to a composition that includes a 
combinatorial library and a receptor in contact with the combinatorial library. In this 
composition, some of the complexes bind preferentially to the receptor. The 
combinatorial library includes a plurality of at least six different complexes. Each of the 
complexes is formed of at least one complexing agent and at least two non-biopolymer 
ligands that are reversibly bonded to the complexing agent, and each different complex in 
the library has different ligands bonded to the complexing agent. 

The present invention also relates to a method for identifying a 
combination of non-biopolymer ligands which bind preferentially to a receptor. The 
method includes providing a combinatorial library; contacting the combinatorial library 
with a receptor under conditions effective to preferentially bind a fraction of the plurality 
of complexes; and identifying the fraction of the plurality of complexes which are bound 
preferentially to the receptor. In this method, the combinatorial library includes a plurality 
of at least six different complexes. Each of the complexes is formed of at least one 
complexing agent and at least two non-biopolymer ligands that are reversibly bonded to 
the complexing agent, and each different complex in the library has different ligands 
bonded to the complexing agent. 

The present invention also relates to a methods for producing a 
combinatorial library which includes a plurality of at least six different complexes. Each 
of the complexes in the library produced by these methods is formed of at least one 
complexing agent and at least two non-biopolymer ligands that are reversibly bonded to 
the complexing agent, and each different complex in the library has different ligands 
bonded"tolhevComplexing agent. Furthermore, each of the plurality of complexes has the 
formula^A%^^here Z is a complexing agent capable of reversibly binding to two or , 
more ligands, each A 1 is a non-biopolymer ligand capable of reversibly binding to Z and 
is independently selected from a group of non-biopolymer ligands having at least three 
different members, n is the number of A's that are reversibly bonded to Z and is an integer 
equal to two or greater, and i is an index number for each A and is an integer from 1 to n. 

In one method for producing a combinatorial library to which the present 
invention relates, one part of a complexing agent, Z, or a salt thereof is contacted with m 1 
parts of-A^and-m 1 parts of A 1 under conditions effective to form a substantially statistical 
mixture q^ Z^'^? ^ 1 and each A' are non-biopolymer ligands; i is an integer from 2 to k; 
k is an integer equal to or greater than 3 and represents the number of members in the 



group of at least three different non-biopolymer ligands from which the at least two non- 
biopolymer ligands are selected; and the sum of m 1 and Z m l from i=2 to k equals n. 

In another method for producing a combinatorial library to which the 
present invention also relates, a complex, Z(A x ) n is contacted with two or more 
complexes Z(A') n under conditions effective to form a substantially statistical mixture of 
Z(A ! ) n . A 1 and each A 1 are non-biopolymer ligands; i is an integer from 2 to k; and k is an 
integer equal to or greater than 3 and represents the number of members in the group of at 
least three different non-biopolymer ligands from which the at least two non-biopolymer 
ligands are selected. 

The combinatorial libraries, compositions, and methods of the present 
invention permit the selection and amplification of non-biopolymeric molecules which are 
targeted to a particular receptor, where the selection and amplification criteria are based 
primarily on differences in binding affinity to a receptor. Thus, the present invention has 
utility for the identification and preparation of non-biopolymeric molecules which are 
targeted to a particular receptor. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a bar graph showing the relative amounts of various ligands in 
initial elutions in the presence of various concentrations of zinc ion and in the presence 
and absence of salicylaldehyde. 

Figure 2 is a bar graph showing the relative amounts of various ligands in 
initial elutions from unfunctionalized cellulose resin in the presence of various 
concentrations of zinc ion and in the presence and absence of salicylaldehyde. 

Figure 3 is a graph showing the dependence of UV absorbance of a mixture 
of salicylaldehyde and N-methyl 2-aminoethyl pyrrolidine on the zinc ion concentration. 

Figure 4 is a bar graph showing the peak volumes for libraries prepared in 
the presence and absence of zinc ion. 

Figure 5 is a bar graph showing the effect of zinc ion presence on the peak 
area of various complexes contained in libraries according to the present invention eluted 
from DNA-cellulose. 
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Figure 6 is a bar graph showing the effect of zinc ion presence on the peak 
area of various complexes contained in libraries according to the present invention eluted 
from control cellulose. 

Figure 7 is a schematic diagram showing a process according to the present 

5 invention. 

Figure 8 is a bar graph showing the results of affinity selection and 
amplification of a self-assembled combinatorial library of the present invention using a 
double-stranded DNA receptor. 

Figure 9 is a bar graph showing the results of affinity selection and 
1 0 amplification of a self-assembled combinatorial library of the present invention using a 

single-stranded DNA receptor on cellulose and the results of a control selection on cellulose 
JJr alone. 

C8 Figure 1 0 is a bar graph showing the ratio of a pair of complexes in a 

combinatorial library of the present invention following incubation with double-stranded 
1 5 DNA in the presence and absence of zinc. 

Figure 1 1 is a series of NMR spectra showing the effect of the ratio of a 
ligand to ZnCh on the NMR spectrum. 
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===»= 



20 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention relates to a combinatorial library which includes a 
plurality of at least six different complexes. Each of the complexes is formed of at least 
one complexing agent and at least two non-biopolymer ligands that are reversibly bonded 
to the complexing agent, and each different complex in the library has different ligands 
25 bonded to the complexing agent. Two ligan dsjvhich are structurally foe same but whic h 
have different optical co nfigurations are to be considered "different" for purposes of th e 
present invention. 



As used herein, a ^gmplexing agenls^s meant to include any atom, group of 
atoms, or ion that is capable of reversibly binding two or more non-biopolymer ligands. 
30 Examples of suitable complexing agents include metal atoms and metal ions, which, as 
used herein, are meant to include Na, t^^^Cs, Fr, Mg, Ca, Sr, Ba, Ra, Sc, 
lanthanides, the actinides, Ti, Zr, Hf, V, Nb, Ta, Cr, Mo, W, Mn, Tc, Re, Fe,(Ru)osKCp, 
Rh, Ir, Ni, Pd, Pt, Cu, Ag, Au, Zn, Cd, Hg, B, Al, Ga, In, Tl, Si, Ge, Sb, Pb, P, As, Sb, Bi, 
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Se, Te, Po, and their stable ions. Preferably, the metal or metal ion complexing agent has 
a valency such that it undergoes fast ligand exc hange. Considerations for selecting such 
preferred metal or metal ion complexing agents based on this criterion are known to those 
of ordinary skill in the art, and further detail regarding these considerations is are set forth 
5 in Cotton et al., Advanced Inorganic Chemistry. A Comprehensive Text , 4th ed., New 
York: John Wiley & Sons (1980) ("Cotton"), which is hereby incorporated by reference. 
For example, since Co(II) undergoes fast ligand exchange while Co(III) does not, the use 
of Co(II) as a complexing agent is preferred to Co(III) in the practice of the present 
invention. 

10 As used herein, a^ligand is\neant to include any compound that is cap ably 

of reversibly binding to the complexing agent. A non-biopolymer ligand is meant to 
include any ligand provided that the ligand is not a DNA molecule, an RNA molecule, or 
a polypeptide. For purposes of the present invention, DNA and RNA molecules are : 
compounds that consist solely of two or more nucleotides linked by phosphodiester 

15 bonds, and polypeptides are compounds that consist solely of two or more amino acids 
linked by amide bonds. Other biologically based compounds, such as cellulose, starches, 
sugars, and the like are considered to be non-biopolymer compounds for the purposes of 
the present invention. 

^Non-biopolymer ligancfe suitable for use in the practice of the present 

20 invention are those which reversibly bind to the complexing agent. Where the 

complexing agent is a metal atom or metal ion, suitable ligands are those compounds that 
have at least one functional group capable of bonding to a metal or metal ion. Examples 
of such functional groups include liydroxyl grou ps^ nitrogen-containing groups 
(including, e.g., amino groups and heterocyclic nitrogens), carboxyl groups (e.g., ketones, 

25 aldehydes, and carboxylic acids and derivatives thereof) and the like. Illustratively, the 
following structures represen ^suitable non-biopolymer ligands ^) 





OH O 
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where R, R', R", and R 1 ", when bonded to a ring, represent one or more ring substituents 
and, when bonded to a nitrogen atom, represent a single substituent. Suitable ring 
substituents include, halogen, hydroxy, substituted or unsubstituted alkyl, alkoxy, 
aryloxy, substituted or unsubstituted aryl, an amino group (which may, optionally, be 
substituted with one or two alkyl or aryl groups), a carboxylic acid, ester, or amide group, 
and a substituted or unsubstituted heterocycle (e.g., a substituted or unsubstituted pyridyl, 
furanyl, piperidinyl, or morphilino group). Suitable nitrogen substituents include 
substituted or unsubstituted alkyl, substituted or unsubstituted aryl, or substituted or 
unsubstituted heterocycle (e.g., a substituted or unsubstituted pyridyl, furanyl, piperidinyl, 
or morphilino gjpup)r 

C Prefer^blyffie non-biopolymer ligand also contains one or more groups 
which are capable of recognizin g a biological element, such as a D NA molecule or a 
particular portion of a DNA molecule. Such a group will be referred to herein as a 



15 



'^recognition e lemen ^ Examples of such recognition elements i nclude DNA intercalators, 
such as those which have the following formulae: 
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I '' Preferred intercalators are pyrenes and anthracenes. Other examples of such recognition 

I** 5 elements include DNA major or minor groove binders, such as those which have the 

f|J following formulae: 
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Other suitable recognition elements include hydroxy groups, pyrrolid-2-yl groups, N- 
alkylpyrrolid-2-yl (e.g., N-methylpyrrolid-2-yl) groups, alkoxy (e.g., methoxy) groups, 
tetrahydrofuran-2-yl groups, pyrid-2-yl groups, and substituted or unsubstituted phenyl 
(e.g., 4-fluorophenyl) groups. j? 

These recognition elements can be bonded directly to the above-described 
non-biopolymer ligands as one or more of the recited R, R", and R fM groups. 
Alternatively, they can be bonded through a suitable connector moiety to the above- 
described non-biopolymer ligands as one or more of the recited R', R", and R"' groups. 
Suitable connectors include substituted or unsubstituted alkylene moieties (e.g., -CH 2 - ? 
-CH 2 CH 2 -, -CH 2 CH 2 CH 2 -, -CH 2 CH(CH 3 )CH 2 -). Other suitable connectors include 
moieties having the formulae: 
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~jz. 



which may be optionally substituted (e.g., with alkyl or aryl groups) at the carbons or 
nitrogens to which hydrogens are bonded. 

The above-discussed non-biopolymer ligands can be expressed in terms of 
the following formula: 



Core 




Connector 




Recognition Element 







where the recognition element and connector are as described above and where suitable 
core moieties are those above-described structures which have one or more R, R, R", and 
R m groups bonded thereto. As one skilled in the art will appreciate, a core can be 
^ connected to more than one connector, and each connector may, in turn, be attached to 

ski 1 0 one or more reeogmfi^^lemgnts. 

S uitable. non- biopolymer ligands)also can include simple molecules, such 
as, for example, water, amines, alcohols, thiols, and carboxylic acids and deri vativ es 
C3 thereof Amines, alcohols, and thiols that can be used as non-biopolymer ligands in the 



I ' practice of the present invention can be primary or secondary amines, alcohols, and thiols. 

15 Compounds containing two or more of these functional groups (e.g., ethylenediamine) 
f|j can also be employed. Preferably, at least one of the non-biopolymer ligands contains a 

*IJ group that is capable of recognizing a biological element, such as a DNA molecule or a 

particular portion of a DNA molecule. 

As indicated above, the combinatorial library of the present invention 
20 includes a plurality of at least six different complexes and each of the complexes is 

formed of at least one complexing agent and at least two non-biopolymer ligands that are 
reversibly bonded to the complexing agent. In addition, the combinatorial library 
includes at least six different complexes and each different complex in the library has 
different ligands bonded to the complexing agent. 
25 For example, the combinatorial library can include a plurality of 

complexes, each of which has the formula Z(A ! ) n? where Z is a complexing agent capable 
of reversibly binding to two or more ligands, each A 1 is a non-biopolymer ligand capable 
of reversibly binding to Z, and n is an integer equal to two or more and represents the 
number of non-biopolymer ligands that are bonded to Z. In the above formula, i is an 
30 index number which identifies each of the ligands individually (e.g., A 1 , A 2 , A 3 , A 4 , etc) 
and can have values from 1 to n. Each A', is independently selected from a group of non- 
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biopolymer ligands having at least three different members. Illustratively, when the 
number of non-biopolymer ligands that are bonded to Z is equal to 2, n will be 2, and 
there will be two A's (i.e., A 1 and A 2 ); when the number of non-biopolymer ligands that 

12 3 

are bonded to Z is equal to 3, n will be 3, and there will be three A's (i.e., A , A , and A ); 
when the number of non-biopolymer ligands that are bonded to Z is equal to 4, n will be 
4, and there will be four A's (i.e., A 1 , A 2 , A 3 and A 4 ); when the number of non- 
biopolymer ligands that are bonded to Z is equal to 5 5 n will be 5, and there will be five 
A's (i.e., A 1 , A 2 , A 3 A 4 , and A D ); when the number of non-biopolymer ligands that are 
bonded to Z is equal to 6, n will be 6, and there will be six A's (i.e., A 1 , A 2 , A 3 A 4 , A\ and 
A 6 ); when the number of non-biopolymer ligands that are bonded to Z is equal to 7, n will 
be 7, and there will be seven A's (i.e., A 1 , A 2 , A 3 A 4 , A 5 , A 6 and A 7 ); and when the 
number of non-biopolymer ligands that are bonded to Z is equal to 8, n will be 8, and 
there will be eight A's (i.e., A 1 , A 2 5 A 3 A 4 , A 5 , A 6 A 7 , and A 8 ). Stated differently, eacfrof 
the plurality of complexes can be said to have the formula Z(A 1 )(A 2 )(A') n _2, where A 1 and 
A 2 are non-biopolymer ligands capable of reversibly binding to Z and are independently 
selected from a group of non-biopolymer ligands having at least three different members. 
In this case, (A ! ) n _2 represents the remaining (i.e., other than A 1 and A 2 ) n-2 non- 
biopolymer ligands that are bonded to Z. 

As one skilled in the art will appreciate, the values of n and i, as used * 
herein, are not to be construed as an indication of the total number of A's in the group 
(e.g., ligands in the library) from which the A's are selected. Instead, n refers to the 
number of ligands bonded to the complexing agent, and i is an index number for each of 
those ligands. 

As indicated above, each non-biopolymer ligand is independently selected 
from a group of non-biopolymer ligands having at least three different members. For 
purposes of illustration, the at least three different members from which each non- 
biopolymer ligand is independently selected will be designated Bl, B2, and B3. Thus, 
when n non-biopolymer ligands are bonded to Z, the combinatorial library can include 
complexes having the formulae Z(B1)(B1)(AV2, Z(B1)(B2)(AV 2 > Z(B1)(B3)(AV2, 
Z(B2)(B2)(AV 2 , Z(B2)(B3)(A i ) n . 2 , and Z(B3)(B3)(A i V 2 . For example, when n is 2, the 
combinatorial library includes complexes having the formulae Z(B1)(B1), Z(B1)(B2), 
Z(B1)(B3), Z(B2)(B2), Z(B2)(B3), and Z(B3)(B3). While the minimum number of 
members in the group from which non-biopolymers are selected is 3, this number can be 



higher (e.g., 4, 5, 6, 7, 8, 9, 10, or more). For example, when the number of members in 
the group from which non-biopolymers are selected is 4 (e.g., Bl, B2, B3 5 and B4) and 
the number of non-biopolymer ligands bound to each Z is 2, the combinatorial library can 
include complexes having the formulae Z(B1)(B1), Z(B1)(B2), Z(B1)(B3), Z(B1)(B4), 
Z(B2)(B2), Z(B2)(B3), Z(B2)(B4), Z(B3)(B3) Z(B3)(B4), and Z(B4)(B4). Generally, 
when the number of members in the group from which non-biopolymers are selected is m 
(e.g., Bl, B2, B3, ... Bm), and the number of non-biopolymer ligands bound to each Z is 

2, the combinatorial library will contain (m 2 +m)/2 different complexes. As indicated 
above, the minimum number of non-biopolymer ligands which are bound to each Z is 2. 
However, depending on the nature of Z and of the non-biopolymer ligands present, the 
number of non-biopolymer ligands which are bound to each Z can be greater than 2 (e.g., 

3, 4, 5, 6, 7, 8, or more). For purposes of the present invention, a ligand bonded to Z is 
counted only once, even if it is bonded through two or more groups (such as where the 
ligand is a bidentate, tridentate, or tetradentate ligand). 

Also included in the meaning of non-biopolymer ligands, A, are moieties 
having the formula -A'Z'(Q), where A' is a pluridentate (i.e., bidentate, tridentate, etc.) 
non-biopolymer ligand, Z' is a second complexing agent that is identical to or different 
than Z, and Q represents one or more non-biopolymer ligands. Thus, for example, when 
each Z is bonded to two non-biopolymer ligands (i.e., n=2) selected from a group of 
ligands containing at least one pluridentate (i.e., bidentate, tridentate, etc.) ligand, A', and 
other ligands, the combinatorial library of the present invention can contain complexes 
having the formula: 

Q z A : 2 Q 

where each Q is independently selected from the group of ligands (i.e., Q can be A* or one 
of the other ligands in the group). 

In the case where each Z is bonded to three non-biopolymer ligands (i.e., 
n=3) selected from a group of ligands containing at least one pluridentate (i.e., bidentate, 
tridentate, etc.) ligand, A', and other ligands, the combinatorial library of the present 
invention can contain complexes having the formula: 
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where each Q is independently selected from the group of ligands (i.e., Q can be A* or one 
of the other ligands in the group). 



non-biopolymer ligands, A, can be moieties having the formula -A'Z'(Q) ? where each A' 
is the same or different and is a pluridentate (i.e., bidentate, tridentate, etc.) non- 
biopolymer ligand, each Z' is a second complexing agent that is identical to or different 
than each other and Z, and Q represents one or more non-biopolymer ligands. Thus, for 
example, in the case where each Z is bonded to three non-biopolymer ligands (i.e., n=3) 
selected from a group of ligands containing at least one pluridentate (i.e., bidentate, 
tridentate, etc.) ligand, A', and other ligands, the combinatorial library of the present 
invention can contain complexes having the formula: 



where each Q is independently selected from the group of ligands (i.e., Q can be A' or one 
of the other ligands in the group) and each A f can be the same or (in the case where the 



when each Z is bonded to three non-biopolymer ligands (i.e., n=3) selected from a group 
of ligands containing at least one pluridentate (i.e., bidentate, tridentate, etc.) ligand, A', 
and other ligands, the combinatorial library of the present invention can contain 
complexes having the formula: 



In still another embodiment of the present invention, two or more of the 



Q 



Q 



■Z A' Z A Z Q 



group of ligands contains more than one pluridentate ligand) different. Alternatively, 



Q Z, 




7- 1 



Q 



where each Q is independently selected from the group of ligands (i.e., Q can be A' or one 
- of the other ligands in the group) and each A ? can be the same or (in the case where the 
group of ligands contains more than one pluridentate ligand) different. Stated more 
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generally, where the combinatorial library contains complexes having the formula 
Z(A 1 )(A 2 )(A , ) n -2, A 1 and A 2 , taken together, can represent a moiety having the formula: 

J' 



A 



Z"(Q) 



where A 1 and A 2 are pluridentate (i.e., bidentate, tridentate, etc.) ligands and are the same 
or (in the case where the group of ligands contains more than one pluridentate ligand) 
different. In these cases, Z(A , )(A 2 )(A i ) n . 2 has the formula: 



(AV 2 z^^ ^z\q) 

A 2 ' 

As indicated above, the non-biopolymer ligands, A, bound to the 
complexing agent, Z, are bound reversibly. Thus, for example, suitable complexes are 
those which can undergo ligand exchange. Preferred complexes are those where Z and 
each A are selected so that the reactions Z(A , ) n _i + A 1 Z(A l )(A , ) n -i and Z(A 1 )(A , ) n -i 
Z(A') n _i + A 1 each have a rate constant of greater than about 2 per second, more 
preferably, greater than about 4 per second, and, most preferably, greater than about 10 
per second. Rate constants for these reactions can be measured by conventional methods, 
which are well known to those skilled in the art, such as, for example, by the methods 
described in Cotton, particularly in Chapter 28, which is hereby incorporated by 
reference. 

As indicated above the combinatorial libraries of the present invention 



contain at least six different complexes. For purposes of the present invention, two 



complexes are different if one complex contains a lig and whicH"is not present'lnTEe ot her 
complex. As one skilled in the art will recognize the number of complexes in the 
combinatorial library will depend on the number of ligands present in the group of ligands 
from which the ligands are selected and on the number of ligands which can bind to a 
single complexing agent. Preferably, the combinatorial library contains at least about 20 
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different complexes; more preferably, the combinatorial library contains at least about 40 
different complexes; still more preferably, the combinatorial library contains at least 
about 60 different complexes; still more preferably, the combinatorial library contains at 
least about 100 different complexes; still more preferably, the combinatorial library 
5 contains at least about 250 different complexes; and most preferably, the combinatorial 
library contains at least about 500 different complexes. 

The present invention also relates to a composition which includes a 
combinatorial library and a receptor in contact with the combinatorial library. 
Combinatorial libraries suitable for the practice of the present invention include those 

1 0 described hereinabove. Suitable receptors include, for example, biological receptors, 
such as, DNA molecules, RNA molecules, oligosaccharides, whole cells or fragments 
thereof, proteins and other polypeptides, and polymers. Typically, the receptor will be 
one having particular biological interest, for example, a particular protei n, RNA 
molecule, or DNA molecule that is linked to the occurrence of a particular disease, 

1 5 disease state, or condition. Many such proteins, RNA molecules, or DNA molecules are 
known to those of ordinary skill in the art. 

The qe ceptor^ i n contact with the combinatorial libra gy. Typically, this is 

achieved by dissolving or suspending the constituent parts of the combinator ial librar y in 

— - 

a suitable medium (e.g., water or a buffered aqueous solution) which is in contact with the 
20 receptor. The receptor can be also dissolved or suspended in the medium or it can be 

immobilized on a solid support, such as cellulose or related resins. Advantageously, the 
receptor is immobilized on a solid support material that is packed into a column, and the 
combinatorial library is suspended or dissolved in a medium (e.g., water or a buffered 
aqueous solution) which is present in the column and in contact with the solid support 
25 material on which the receptor is immobilized. Immobilization of biological receptors on 
solid supports can be carried out using conventional methods, which are well known to 
those of ordinary skill in the art. As one of ordinary skill in the art will recognize, where 
temperature or light sensitive biological receptors are employed, appropriate measures 
should be taken to prevent denaturation of these biological materials. Upon initial contact 
30 of the receptor with the combinatorial library, few if any of the receptors will be bound by 
the complexes contained in the library; however, over time, an increasing number of the 
receptors will be bound by these complexes. Thus, the composition of the present 
invention may contain receptors which are bound to complexes contained in the 
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combinatorial library. Generally, one skilled in the art will be able to predict the types of 
ligands which will bind to a particular biological receptor, and the combinatorial library 
will be designed accordingly. For example, where the biological receptor is a DNA 
molecule, one skilled in the art would identify DNA intercalators or major or minor 
5 groove binders as suitable ligands for construction of the combinatorial library. 

The combinatorial libraries of the present invention are useful for 
identifying a combination of non-biopolymer ligands which binds preferentially to a 
receptor, such as a biological receptor, examples of which include those provided above. 
As used herein, the term "identifying" is meant to include "preparing" and "isolating" 
10 even though the identity of the desired combination may already be known, 
f The method includes providing a combinatorial library of the present 

^ invention, such as one of the combinatorial libraries described hereinabove, and 

Cil contacting the combinatorial library with a receptor under conditions effective to 

[7 pre ferentially bind a fraction of the plurality of complexes present in the combinatp rial 

w 15 library to the receptor. Generally, the combinatorial library is present as a suspension or 

= solution in water or some other suitable solvent, and the receptor is either dissolved or 

suspended in the water or other suitable solvent or immobilized, for example, on a 
Til suitable resin. 

*h Contacting is carried out under conditions which are effective to 

^ 20 preferentially bind a fraction of the plurality of complexes present in the combinatorial 

library. For example, the temperature at which the contacting is carried out is preferably 
between about 0°C and 40°C, although other temperatures can be employed so long as the 
biological receptor is stable at these temperatures and so long the ligand exchange rates 
(i.e., the rate at which the reactions Z(A\.\ + A 1 -» ZCA'XA'Vi and ZCA'XAVi -> 
25 Z(A') n _i + A 1 occur) remain reasonably fast (preferably greater than about 2 per second). 
Physiologically relevant conditions (e.g., about 1-100 mM N-(2- 

hydroxytethyl)piperazine-N'(2-ethanesulfonic acid) ("HEPES"), about 10-1000 mM KC1, 
and pH from about 7 to about 8) are generally suitable conditions for carrying out the 
contacting step of the method of the present invention, especially where biological 
30 receptors are involved. The time period for which the combinatorial library and receptor 
is contacted can vary, but, generally, longer times give rise to better results in terms of 
the fraction of receptors that are bound to the preferentially-bound complex. Suitable 
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time periods for maintaining contact between the combinatorial library and receptor are 
from about 10 minutes to about 5 days, preferably from about 1 hour to about 8 hours. 

As a result of the above-described method, the receptors are preferentially 
bound (preferably, substantially exclusively bound) to the complex having the highest 
5 affinity (i.e., the greatest equilibrium constant) therefor. Most if not all other complexes 
remain unbound. A complex can be considered to be preferentially bound to a receptor if, 
after equilibrium is established, the complex occupies greater than about 50%, preferably, 
greater than about 75%, and more preferably, greater than about 90% of the available 
receptor sites. 

10 Although not wishing to be bound by the following proposed explanation, 

applicants believe that the observed phenomenon can be explained by virtue of the fact 
that the bin ding between the ligands and the compl ejringjj gent is reversibl e, thus 
permitting ligands to exchange in response to perturbations in the ligand/complexing 
agent equilibrium. By adding a receptor, however, an additional equilibration ensues. This 

1 5 equilibration involves the binding of the complexes to the receptor with varying degrees of 
affinity. Since some complexes bind with higher affinity than others, those complexes 
would be depleted from the pool of equilibrating complexes. By simple mass balance rules, 
the equilibrium would then have to shift in favor of complexes that bind to the receptor, 
effectively using the thermodynamics of ligand-receptor binding to direct the synthesis of 

20 the complex having the highest affinity for the receptor. In the end, assuming there is 
sufficient amounts of the ligands necessary to produce a sufficient amount of highest 
affinity complex, the receptor is saturated with the highest affinity complex. 

The preferentially bound complex can then be identified or characterized, 
such as by conventional methods (GC-MS, HPLC, etc). Typically, this involves separating 

25 the preferentially bound complex from the other complexes that are in the combinatorial 

library. Since the preferentially bound complex is attached to the receptor, this is generally 
achieved by separating the receptor (with preferentially bound receptor attached thereto) 
from the other complexes that are in the combinatorial library. In the cases where the 
receptor is immobilized, for example, as described above, such separation can be readily 

30 achieved, for example, by elution, filtration, and the like. In cases where the receptor is 

suspended or dissolved in a medium (e.g., a solvent) containing the other complexes in the 
combinatorial library, separation can be effected by filtration, polyethylene glycol 
precipitation, and the like. Once separation is achieved, the preferentially bound complex 
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can be readily identified using conventional characterization methods. In some cases, it 
may be advantageous to separate the preferentially bound complex from the receptor to 
which it is bound. This can be effected by denaturation of the receptor or by elution from 
the receptor. Preferably, such a separation should be carried out under conditions which 
disfavor ligand exchange. In some situations, this can be achieved by changing the 
oxidation state of the metal complexing agent so that, after the change, the metal 
complexing agent is non-labile (i.e., undergoes ligand exchange very slowly). Further 
detail with regard to controlling the lability of a complexing agent is set forth in Cotton, 
particularly in Chapter 28, which is hereby incorporated by reference. 

The combinatorial libraries described and used above can be prepared by the 
following method to which the present invention also relates. In this method, one part (by 
mole) of a complexing agent, Z (or a salt thereof), is contacted with m 1 parts of A 1 and m 1 
parts of A 1 . A 1 and each A 1 are non-biopolymer ligands, examples of which are described 
hereinabove; i is an integer from 2 to k; k is an integer equal to or greater than 3 and 
represents the number of members in the group of different non-biopolymer ligands from 
which the non-biopolymer ligands are selected; and the sum of m 1 and £ m 1 from i=2 to k 
equals n. For example, where the ligands are selected from a group of 4 different ligands 
(i.e., k=4), these ligands can be identified as A 1 , A 2 , A 3 , and A 4 . Each of A 1 , A 2 , A 3 , and 
A 4 , are present in amounts (expressed in parts by mole) of m 1 , m 2 , m 3 , and m 4 , 
respectively, and the sum ofm 1 , m 2 , m 3 , and m 4 is equal to n. Generally, the amounts of 
the different non-biopolymer ligands in the reaction mixture will be substantially the 
same. Illustratively, returning to the above example and assuming that the number of 
ligands which bind to each complexing agent is 2, each of m 1 , m 2 , m 3 , and m 4 can be 
about 0.5. The complexing agent and non-biopolymer ligands are contacted under 
conditions effective to form a substantially-statistical mixture of complexes. Suitable 
conditions are those which permit equilibriums^) be established between the various 



ligands and the complexing agent. Typically, this involves carrying out the reaction at a 
temperature of from about 0°C and 100°C, preferably about room temperature, for a time 
sufficient for equilibration to occur (e.g., about 10 minutes to about 5 days, preferably 
about 1 to about 2 hours). Either or both of the ligands and complexing agent can be 



present as^alsuspension or as a solution, but the latter is preferred. Although it is not 



generally necessary, the statistical proportions of the various complexes in the resulting 
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combinatorial library can be calculated based on the equilibrium constants of Z and each 
different ligand and the proportions of the different ligands present in the mixture. 

The combinatorial libraries described and used above can also be prepared 
by the following alternative method to which the present invention also relates. In this 
5 method, a complex, Z(A x ) n , is contacted with two or more complexes Z(A') n under 

conditions effective to form a substantially statistical mixture of Z(A l ) n . A 1 and each A 1 
are non-biopolymer ligands; i is an integer from 2 to k; and k is an integer equal to or 
greater than 3 and represents the number of members in the group of different non- 
biopolymer ligands from which the ligands A 1 and A 1 are selected. In this embodiment, 

10 each of the n A !, s in Z(A ! ) n is the same, and each of the n A"s in each Z(A ! ) n is the same. 
The Z(A ! ) n and Z(A')n complexes are contacted under conditions effective to form a 
substantially statistical mixture of complexes. Suitable conditions are those which permit 
the ligands on the Z(A ! ) n and Z(A') n to re-equilibrate. Such conditions include carrying 
out the reaction at a temperature of from abg.ut-0-G~and 100°C, preferably-about room 

1 5 temperature;-carrying^out the reaction in a gommon solve nt) for examp^^atet)or a 
buflfered^aqueous soluti^nTahd carrying out the reaction for a time sufficient for re- 



equilibration to occur (e.g., about 10 minutes to about 5 days, preferably about 1 to about 
2 hours). Either or both of the Z(A ! ) n and Z(A') n can be present as a suspension or as a 
solution, but the latter is preferred. Although it is not generally necessary, the statistical 
20 proportions of the various complexes in the resulting combinatorial library can be 
calculated based on the equilibrium constants of Z and each different ligand and the 
proportions of the various Z(A*) n and Z(A , ) n initially in the mixture. 

The following examples are provided to illustrate embodiments of the 
present invention but are by no means intended to limit its scope. 
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EXAMPLES 



Example 1 — General Materials and Methods 

Salicylaldehyde, 3-methoxypropylamine, 3 -hydroxy propylamine, 2- 
30 aminoethylpyridine, N-methyl 2-aminoethyl pyrrolidine, 2-aminomethylfuran, (4- 

fluoro)phenethylamine, trifluoroacetic acid, 2-naphthoyl chloride, and methylene chloride 
were obtained from Aldrich chemical company and used without further purification. 
Triethylamine was obtained from J. T. Baker Corp. and distilled over calcium hydride 
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under nitrogen atmosphere prior to use. HEPES and phosphate buffered saline ("PBS") 
buffers were prepared according to standard protocols from materials purchased from 
Sigma chemical company. Deionized water was subject to redistillation in an all-glass 
apparatus prior to use. Zinc chloride and zinc acetate were used as obtained from J. T. 
Baker Corp. 01igo(dT)-cellulose resin, oligo(dA), and poly(dA-dT) were obtained from 
Pharmacia Biochem. Unfunctionalized cellulose for control experiments was used as 
obtained from Sigma chemical company. HPLC data on derivatized amines were 
obtained using either a Beckman 1 12 ternary gradient HPLC system equipped with an 
Alcott fixed-fill autosampler, a Whatman EQ-C18 reverse-phase column (4.6 x 300 mm), 
and UV detection, or a Hewlett-Packard 1050 quaternary HPLC equipped with an 
autosampler, Hamilton RP-1 reverse-phase column (4.6 x 150 mm), and UV detection. 
LC-MS was carried out using a Hewlett-Packard Series 1 100 MSD, using electrospray 
ionization in positive ion mode and a Whatman EQ-C18 reverse-phase column. Mass;i 
spectral analysis of zinc complexes was performed by direct injection (bypassing the 
column) of samples into the Hewlett-Packard Series 1 100 MSD, with electrospray 
ionization operating in positive ion mode. UV-Vis spectra and kinetics experiments were 
carried out on a Shimadzu 1601 -PC spectrophotometer at ambient temperature (23°C +/- 
1°C). 

Example 2 — Preparation of DNA Affinity Resins 

For each sample, 1 5 mg oligo(dT) cellulose was weighed into a 2 ml Bio- 
Rad BIOSPIN™ polypropylene chromatography column equipped with a glass frit. The 
resin was then taken up in 750 [il HEPES buffer (10 mM HEPES, 100 mM KC1, pH 6.8). 
0.5 A(260) units of oligo(dA) were then added in 50 (il H2O and allowed to incubate on the 
resin for one hour at ambient temperature prior to the addition of library samples. During 
this pre-incubation period, as well as throughout the remainder of the experiments, gentle 
agitation was provided by rotating the samples on a Barnstead-Thermolyne 
LABQUAKE™ rotary shaker. In a similar manner, 15 mg of unfunctionalized cellulose 
for each control experiment was allowed to soak in 800 ^il HEPES buffer at ambient 
temperature for one hour prior to the addition of library samples. 
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Example 3 — Preparation of Libraries 

Amines 1 to 6, having the following formulae: 




were incubated with salicylaldehyde. Since amines react with the salicylaldehyde to 
reversibly form imines, the following equilibria are established: 
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As one skilled in the are will note, the complexes shown above represent only a subset of 
the complexes that would be present in this equilibrium. Samples were prepared in 
triplicate from stock solutions as indicated in Table 1 to provide a total volume of 450 jil. 
An identical set of samples was prepared for control experiments. In each case, libraries 
5 were allowed to incubate at ambient temperature for one hour prior to addition to the 
DNA-cellulose or control cellulose resin. Additional sets of library samples were 
prepared according to lines C and D of Table 1 for control experiments conducted in the 
absence of DNA-cellulose or unfunctionalized cellulose resin. 



10 Table 1 

Library Samples, "High" Amine Concentration 

Sample Set Concentration Zn (2+) Salicylaldehyde 
of each amine Concentration Concentration 
"A 164 0 82 

B 164 41 0 

C 164 41 82 

D 164 82 82 

1 All samples were prepared in triplicate, as described in the text. Concentrations are in 
micromolar, and reflect the final concentration obtained following addition of the library 
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mixture to the DNA-cellulose or control resin. 



Example 4 — Incubation of Libraries With Affinity Resins 

Following the pre-incubation period, each sample was added to either a 
DNA-cellulose or unfunctionalized cellulose column as appropriate. These samples were 
20 then allowed to incubate on the resin with gentle agitation at ambient temperature for six 
hours. Solutions were then eluted from the resin, frozen, and lyophilized prior to 
derivatization and analysis as described below. Each resin sample was resuspended in 1 
ml H2O and agitated for 24 hours; these wash solutions were then eluted from the resin, 
frozen, and lyophilized prior to derivatization and analysis as described below. 



Example 5 — Derivatization of Samples 

Each sample was resuspended in 1.0 ml of a 50% trifluoroacetic acid 
solution in dichloromethane, vortexed, and allowed to react for one hour. Solvent was 
removed under reduced pressure. To each sample of dried material was then added a 
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solution of 0.017 mmol 2-naphthoyl chloride and 200 \xl triethyl amine in 0.8 ml 
methylene chloride. After allowing the reaction to proceed for 10 hours, solvent was 
removed under reduced pressure. Each crude sample was prepared for HPLC analysis by 
dissolution in 700 \il of a 20% H 2 0 - 80% CH 3 CN solution. 

5 

Example 6 -- HPLC Analysis of Samples 

In order to separate and identify as many constituents of each sample as possible, 
a minimum of two HPLC elutions were performed for each sample. In the first, a 30 
minute linear gradient of 30% to 1 00% CH 3 CN in H 2 0 - 0.1% trifluoroacetic acid was 
10 applied. In the second analysis, a linear gradient of 30% to 1 00% CH 3 CN in 10 mM 
?:) NH 4 OAc (pH 4.5) was applied. Elutions were monitored at 230 nm. HPLC peaks were 

?;f assigned by comparison with retention times obtained for standard solutions. 

iii Example 7 — Results 

~z±i 1 5 Average results for derivatized amines from initial elutions are shown in 

s Figure 1. For each case, areas for eluted peaks in the set of zinc-free samples (164 juM 

f*j each amine, 82 \iM salicylaldehyde, no Zn 2+ ) were used to normalize all other values. A 

Lit significant reduction in the amount of derivatized N-methyl 2-aminoethyl pyrrolidine (4) 

si) was observed from the three replicate experiments containing both salicylaldehyde and 41 

CO 2+ 

20 (J.M Zn . Similarly, a large (i.e., >50%) reduction in the amount of 3- 

hydroxypropylamine (6) was observed under these conditions. An unidentified impurity 

unfortunately obscured a peak corresponding to 5-naphthylamide, preventing the 

evaluation of the role of 5 in the library. Comparison with the initial elutions from 

unfunctionalized cellulose resin (Figure 2) indicates similar levels of retention of 6 but 

25 not 4. A slight (ca. 20%) reduction in the amount of 2-aminomethlyfuran (1) in 

salicylaldehyde and 41 |j.M Zn 2+ -containing libraries eluted from DNA-cellulose resin 
was observed. This may indicate a slight affinity of complexes containing 1 for DNA. 
These data indicate that zinc complexes incorporating salicylaldehyde and N-rnethyl 2- 
aminoethylpyrrolidine (4) have the highest affinity for oligo (dA»dT) at the relative 

30 concentrations tested. Libraries containing higher (82 [iM) concentrations of Zn 2+ gave 
markedly different results from those incorporating 41 \xM Zn 2+ . Most notably, a 
significant reduction in the amount of 4 retained on the column was observed. One likely 
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explanation for this observation is that the added zinc changes the population of 
complexes present in the library. However, it is not known at present what the precise 
structural reasons for this might be. As shown in Figure 3, a strong dependence of the 
UV spectra of mixtures of salicylaldehyde and N-methyl 2-aminoethyl pyrrolidine (4) in 
5 HEPES buffer on the concentration of Zn 2+ has been observed. However, the 
concentrations of Zn 2+ evaluated in these UV titrations which caused the inverse 
parabolic variations observed were 10-fold higher than those employed in the present 
affinity experiments. 

To verify that the presence of zinc was not producing artifacts in the 
10 ability to derivatize samples, control libraries, 164 |j,M in each amine, 82 jaM in 
C3 salicylaldehyde, and 0 or 41 jaM in Zn 2+ , were prepared. After allowing these libraries to 

i& incubate in solution (i.e., without contact with any sort of resin) for 6 hours, the libraries 

™ were lyophilized, derivatized, and analyzed in an identical manner to samples from , 

M selection experiments. As shown in Figure 4, differences in peak volumes for libraries 

jT'71 

f|| 15 prepared in the presence or absence of Zn 2+ are less than 10%, with the exception of 3- 
* methoxypropylamine (3). In each case, the differences could be due to experimental 

f*J error, participation of Zn 2+ in the acylation reaction, or by Zn 2+ -accelerated partial 

J=|; reversion by hydrolyzed complexes to imines prior to acylation. These results also 

%S| suggest that differences in peak volumes for derivatized amines that are greater than 10% 

CO 

20 are probably significant, with the exception of 3 -methoxypropylamine. 

A potentially more desirable method of identifying the library constituents 
with the highest affinity would be to remove them from the receptor in some manner: 
following the initial elution of "non-binding" components for subsequent derivatization 
and analysis. It was assumed that simply washing the DNA resin with water would be 

25 sufficient to denature the receptor, since low-salt conditions are known to cause DNA 
melting. However, only small amounts of derivatized amines from wash solutions were 
observed, with no experimentally significant differences among experiments or between 
DNA-cellulose (Figure 5) and control cellulose (Figure 6) resins. 
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Example 8 -- Alternative Method 

Rather than forming the imine constituents of the ligands from a mixture 
of salicylaldehyde and amines, the imines can be pre-formed, as set forth below. 
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The experimental design is shown in schematic form in Figure 7. 
Introduction of a transition metal salt ("M") to a pool of compounds ("monomers") capable 
of forming coordination compounds with M should initiate a series of equilibria among 
various combinations of complexes. Assuming ligand exchange is reasonably fast, this 
5 equilibration permits all combinatorially possible coordination complexes to be populated. 
By adding a receptor covalently linked to a solid support (an affinity resin), however, an 
additional equilibration ensues involving the binding of coordination compounds to the 
receptor with varying degrees of affinity. Since some compounds might be expected to 
bind with higher affinity, those compounds would be depleted from the pool of 
1 0 equilibrating complexes. By simple mass balance rules, the equilibrium would then have to 
shift in favor of compounds that bind to the receptor, effectively utilizing the receptor as a 
catalyst for the synthesis of its own ligand. 
~f;* While this concept could easily be applied to any type of biopolymer or 

other "receptor", double-stranded DN A was chosen as the initial target, for a number of 
}T\ 1 5 reasons. First, the design and synthesis of sequence-selective DNA-binding agents is a 

- problem of continuing fundamental and medical importance. (For example, see Trauger et 

fails 

Hi al., J. Am Chem. Soc . 1 18:6160-6166 (1996) and references therein, which are hereby 

incorporated by reference.) Second, the ladder-like structure of DNA is ideally suited to . 

^1 the application of a modular, self-assembly strategy. Finally, several oligonucleotide 

£0 

20 affinity resins are available commercially, allowing the experimental details of initial 

selection strategies to be determined without the need for costly oligonucleotide and affinity 
resin synthesis. As "monomers", the readily available salicylaldimines7, which have the 
following formula: 
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were chosen. Salicylaldiminesare well known to form coordination complexes with a wide 
variety of transition metals (Garnovskii et al., Coord. Chem. Rev .. 126:1-69 (1993), which 
is hereby incorporated by reference) and a substantial amount of structural information 
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about these complexes is available. Because of its tetrahedral coordination geometry with 
most salicylaldimines(Dreher et al., Naturforsch. 2? ., 42:707-712 (1987), which is hereby 
incorporated by reference) and compatibility with nucleic acids, divalent zinc was used as 
the metal for initial studies. 
5 Six salicylaldimines(8 - 13) displaying a variety of sidechain functionality 

and having the formulae: 





1 0 were synthesized by the condensation of salicylaldehyde and commercially available 
amines. The synthesis and characterization of 8 have been previously described in 
Inouye, S. Chem. Pharm. Bull. , 15:1540-1546 (1967), which is hereby incorporated by 
reference. The synthesis and characterization of compound 10 have been previously 
described in Duran et al., Synth. React. Inorg. Met.-Org. Chem ., 17:681-683 (1987), 

1 5 which is hereby incorporated by reference, and in Ramesh et al., J. Chem. Soc. Dalton , 

pp. 83-86 (1992), which is hereby incorporated by reference. The synthetic procedures for 
preparing compounds 8, 10, 11, and 13 are set forth below in Example 10. Given that 
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compounds 9 and 11 were synthesized in racemic form, a combinatorial library formed 
from these six salicylaldimines could be expected to provide a maximum of 36 unique 
bis(salicylaldiminato)zinccomplexes. These compounds were chosen in part because 
millimolar solutions of each could be prepared in buffered aqueous solution with 1% 
5 DMSO as cosolvent, a finding important to their successful utilization in a combinatorial 
library. 

Although the ability of bis(salicylaldiminato)-zinc(Blackborow et aL, T 
Chem. Res ., 1 19 (1978), which is hereby incorporated by reference) and -nickel 
(Schepartz et aL, J. Am.Chem. Soc , 1 1 1 :5976-6977 (1989), which is hereby incorporated 
10 by reference) complexes to self-assemble in halogenated solvents is well known, we felt it 
was important to verify that this could also occur in aqueous solution before attempting to 
construct a -combinatorial library. Therefore, the dependence of the proton NMR spectrum 
for 12 on ZnCl 2 concentration in D 2 0 was examined. Characteristic of slow-exchange*. 
t & phenomena, substantial line broadening of aromatic, and, to a lesser extent, side-chain 

;JJ 1 5 resonances, is observable on addition of substoichiometric quantities of ZnCl 2 . In addition, 

chemical shifts change markedly. All peaks coalesce into a single set of sharply defined 
n resonances as excess zinc is added. These observations do not provide high-resolution 

3 I 

Jf structural information about these complexes. Therefore, it is conceivable that the 

0" 

8 structure in solution is an addition complex (containing two salicylaldimines coordinated 

20 to ZnCl 2 ) rather than a chelate (see Bottino et aL, Zeitschrift fur Kristallographie , 1 87:71- 
77 (1989), which is hereby incorporated by reference). However, these observations do 
indicate that complexation between the salicylaldimineand Zn 2+ occurs, providing a single 
coordinated species. It was also observed that the addition of ZnCl 2 to a solution of an 
amine and salicylaldehyde in water can induce imine formation followed by 
25 complexation. This suggests that a three-stage equilibrium process could be constructed. 

A serious potential problem with the selection and amplification of a single 
compound from an equilibrating mixture of complexes is that of identifying the desired 
high-affinity species. First, if the selection is not complete for a single compound, the 
separation and identification of a potentially large number of complexes presents a 
30 significant analytical challenge. Furthermore, since reequilibration of complexes can occur 
once the affinity resin is removed, it might be expected that binding information would be 
lost. However, we found that hydrolysis of mixtures of complexes with trifluoroacetic acid 
followed by neutralization and derivatization with excess 2-naphthoyl chloride provided a 
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mixture of amide derivatives that was readily separable by standard reverse-phase HPLC. 
Application of this technique to the eluent from an affinity column would permit 
deconvolutionof the library to provide information about the identity of binding 
complexes. 

Example 9 — Results for Alternative Method 

With a series of monomers, verification of self-assembly in aqueous 
solutions, and a separation method in hand, the experimental tools were in place to conduct 
an affinity selection and amplification experiment. A standard solution of 0.5 mM 8-13 
(total concentration 3 mM in "monomer") was prepared in 10 mM Tris^HCl, 100 mM KC1, 
1% DMSO, pH 7.5. To determine the effect of zinc on the observed selection, samples 
were prepared either with or without 26 MM ZnCl 2 . Samples were allowed to incubate for 
one hour to allow an equilibrium mixture of complexes to form, and the latter were then 
individually added to affinity columns prepared from 30 mg of commercial poly(dT)- 
cellulose resin (Pharmacia Biotech) pre incubated with one A 2 6o unit (approximately 7 
nmol) of oligo (dA)i 2 -i8(01igo (dA)i 2 -i8 as obtained from Pharmacia Biotech is a mixture 
of homopolymeric, single-stranded DNA of length 12 to 18) in 10 mM Tris*HCl, 100 mM 
KC1, pH 7.5 for formation of double- stranded DNA (giving an approximate, average 
complex:basepair ratio of 85:1). Following a two-hour incubation of the libraries on the 
resin, solutions were eluted, lyophilized, and derivatized as described above. 

Figure 8 shows the results of affinity selection and amplification of our ■ 
initial self-assembled combinatorial library. With data normalized to values observed in the 
absence of zinc, it is clear that monomers 9 and 1 1 are most strongly retained on the affinity 
column in the presence of zinc (i.e., less of these monomers is observed following elution 
and derivatization). The observation that twice as much of derivatized monomer 12 is 
eluted from the affinity column in the presence of zinc as in the absence of zinc suggests 
that this monomer participates in some kind of binding interaction with double-stranded 
oligo(dA)-oligo(dT)in the absence of zinc that is not available to either the 12-Zn-12 
complex, or other complexes incorporating monomer 7. These results would then require 
deconvolutionof three possibilities for the strongest binding complex: 9-Zn-9, 11-Zn-ll, 
and 9-Zn-ll (including all stereoisomers) would all be candidates. However, results of 
control experiments rendered this question moot. 
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To ensure that selections observed were based on differential affinity for 
double-stranded DNA, rather than interactions with single-stranded DNA or with the 
underlying cellulose support, control experiments were conducted using single-stranded 
oligo(dT)-cellulose,and cellulose alone, under conditions identical to those for the double- 
5 stranded experiment. As shown in Figure 9, monomer 9 is retained on cellulose or 

oligo(dT)-celluloseto the same extent in the presence or absence of zinc, while 11 is more 
strongly retained on cellulose in the presence of zinc. This suggests that two out of the 
three possibilities for the strongest binding complex to double-stranded DNA (11-Zn-ll 
and ll-Zn-9) do not need to be considered, leaving 9-Zn-9 as likely the tightest DNA- 

1 0 binding complex. Other monomers are generally retained more strongly to single-stranded 
oligo(dT) cellulose in the absence of zinc. 

In order to verify that the results of the affinity selection experiment 
accurately reflected differences in binding affinities, UV binding titrations (as described in 
Krugh. Proc. Natl. Acad. Sci. USA , 69:191 1-1914 (1972)) were conducted for 

1 5 homodimeric complexes 10-Zn-10 (a presumed weak- or non-binding complex) and 

9-Zn-9 (a presumed strong-binding complex). Since both of these complexes have strong 
UV absorbances in the range of 290-350 nm while poly (dA)-poly(dT) does not, the 
concentration of complex was held constant in the presence of varying concentrations of 
DNA in each case. Using commercially available poly(dA)-poly(dT), saturable binding* 

20 was observable in each case. While any analysis of binding constants for these complexes 
is complicated by the homopolymeric, variable-length nature of the DNA used (providing 
multiple, identical binding sites), 9-Zn-9 bound to DNA with an apparent K D of 1 . 1 1 
micrornolar, significantly stronger than 10-Zn-10 (27.8 micromolar). A simple affinity 
selection experiment conducted as described above, but utilizing only monomers 9 and 10, 

25 further corroborated these observations. As shown in Figure 10, the ratio of derivatized 
10:9 eluted following incubation on double-stranded oligo(dT)-cellulose doubles on 
addition of zinc. 

Example 10 — Preparation of 8. 10, 11, and 13 
30 All solvents, salicylaldehyde, and amines were obtained from Aldrich 

Chemical Company and were used without further purification. NMR spectra were 
recorded on a Nicolet/GE QE-300 FT spectrometer operating at 300 ( ] H) or 75 ( 13 C) 
MHz. IR spectra were recorded using a Perkin-Elmer model 1600 FTIR. High-resolution 
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mass spectra (HRMS) analyses were carried out by the UC-Riverside Mass Spectrometry 
Facility, Riverside, California. 

Compound 9 was prepared as follows. To a solution of salicylaldehyde 
(3.0 mmol) and 2-(2-aminoethyl)-l -methyl pyrrolidine (3.0 mmol) in dichloromethane 
5 was added an excess (1 ml, or approximately 2.6 equivalents) triethylamine. Reaction 
was allowed to proceed with stirring at room temperature for two hours. Solvent and 
triethylamine were then removed in vacuo to provide essentially pure 9 in quantitative 
yield. 'H NMR (CDC1 3 , 300 MHz): 5 13.15 (s, 1H), 8.39 (s, 1H), 7.3 (m, 2H), 6.9 (m, 
2H), 3.62 (m, 2H), 3.1 (m, 1H), 2.34 (s, 3H), 2.1 (m, 4H), 1.7 (m, 4H). I3 C NMR (CDClj, 
10 75 MHz): 5 164.6, 161.2, 132.1, 131, 118.8, 118.4, 117,64.2,57.2,57.1,40.5,35.1,30.8, 
22. IR (thin film): 3788.7, 3695.5, 3661.2, 3636.2, 2941.2, 2840.0, 2776.5, 2666.1., 
2360.0, 2341.5, 1725.3, 1709.5, 1688.4, 1664.0, 1632.4, 1581.0, 1549.5, 1497.1, 1460.4, 

1415.5, 1370.4, 1344.2, 1280.3, 1212.4, 1150.9, 1117.2, 1031.6, 970.1, 891.3, 847.6, V 
755.8, 736.2, 668.0. HRMS: calc. 233.1528; obs. 233.1647. 

15 As with 9, 10 was prepared using 3-methoxypropylamine. *H NMR 

(CDCI3, 300 MHz): 5 13.57 (s, 1H), 8.19 (s, 1H), 6.55 (m, 2H), 6.4 (d, 1H, J=9 Hz), 6.38 
(t, 1H, J=9 Hz), 2.9 (t, 2H, J=7.5 Hz), 2.8 (t, 2H, J=7.5 Hz), 2.78 (s, 3H), 2.0 (q, 2H, 
J=7.5Hz). 13 C NMR (CDCI3, 75 MHz): 5 165, 161, 135, 133, 119, 118, 116,70,59,56, 
32. IR (thin film): 3845.6, 3788.5,3695.5, 3681.7, 1660.9,3638.0,2924.1,2870.0, 

20 1725.3, 1709.8, 1689.2, 1657.4, 1632.5, 1581.3, 1549.6, 1529.2, 1497.5, 1460.8, 1414.6, 

1384.6, 1337.6, 1279.6, 1202.0, 1151.0, 1118.1, 1031.3,974.1,849.0, 756.4, 736.4, 
640.1. HRMS: calc. 194.1055; obs. 194.1187 

As with 9, 11 was prepared from 2-aminomethylfuran. ! H NMR (CDCI3, 
300 MHz): 5 13.19 (s, 1H), 8.40 (s, 1H), 7.3 (m, 2H), 6.9 (m, 2H), 4.2 (m, 1H), 3.8 (m, 
25 4H), 1.95 (m, 4H). 13 C NMR (CDC1 3 , 75 MHz): 8166.31, 161.70, 132.64, 131.69, 119.28, 
1 18.89, 1 17.44, 78.61, 68.86, 63.91, 29.59, 26.29. IR (thin film): 3947.8, 3256.7, 3055.7, 

2871.7, 2733.4, 2661.7, 2061.3, 1945.4, 1912.4, 1790.0, 1633.1, 1580.7, 1496.2, 1461.8, 
1416.4, 1365.4, 1315.8, 1281.5, 1246.9, 1209.6, 1151.2, 1116.1, 1074.6, 1046.3, 1026.7, 
996.6,966.4, 922.1. HRMS: calc. 206.1182; obs. 206.1185 

30 As with 9, 13 was prepared from 4-fluorophenethylamine. ! H NMR 

(CDCI3, 300 MHz): 8 13.39 (s, 1H) 8.1 (s, 1H), 7.35 (m, 1H), 7.2 (m, 2H), 7.0 (m, 2H), 
6.9 (m, 2H), 3.8 (t, 2H, J=7.5 Hz), 3.0 (t, 2H, J=7.5 Hz). I3 C NMR (CDC1 3 , 75 MHz): 5 
168, 164, 161, 159, 135, 133, 132, 130, 118, 117.5, 115, 114.5,68, 61, 36.5. IR (thin 



film): 3044.6, 2926.6, 2852.4, 2732.1, 2662.9, 1887.1, 1633.2, 1607.6, 1580.0, 1509.6, 
1460.7, 1415.6, 1376.4, 1342.1, 1279.9, 1221.2, 1156.8, 1116.8, 1098.2, 1073.7, 1053.0, 
1016.1, 9687.6, 894.1, 870.1, 824.5, 778.1, 757.0, 736.3, 724.3, 705.5, 653.1, 640.8, 
573.8, 542.5, 509.5, 481.9, 467.6. HRMSxalc. 244.101 1; obs 244.1 129. 

Example 1 1 - NMR Titration of ZnCb into 10 

A 97 mM solution of 10 in deuterated buffer (PBS, pH 7.4) was prepared, 
and the ! H NMR spectrum measured at 300 MHz, 23 °C. Aliquots of a 1 M solution of 
ZnCh were then added to permit measurement of the NMR spectrum at 1 :0.25, 1 :2, and 
1 :3 molar ratios of 10:Zn 2+ . Plots of the observed NMR spectra are shown in Figure 1 1 . 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations can 
be made therein by those skilled in the art without departing from the spirit and scope of 
the invention which is defined by the following claims. 



